Optimisation of GMM in speaker recognition

نویسندگان

  • Robert P. Stapert
  • John S. D. Mason
  • Roland Auckenthaler
چکیده

Given that the amount of speaker speci c training data is always limited, for a given amount of data a speaker model has an optimum number of components. Here, this is investigated with regard to Gaussian mixture models (GMM) with and without world model adaption. Test results show that maximising the number of components in a speaker model can improve speaker recognition results. Comparisons with vector quantisation (VQ) indicate that sensible use of out-of-class data is essential for optimising a recognition system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explicit modelling of session variability for speaker verification

This article describes a general and powerful approach to modelling mismatch in speaker recognition by including an explicit session term in the Gaussian mixture speaker modelling framework. Under this approach, the Gaussian mixture model (GMM) that best represents the observations of a particular recording is the combination of the true speaker model with an additional session-dependent offset...

متن کامل

Efficient Training of GMM Based Speaker Recognition System

Automatic speaker recognition (ASR) is based on speech feature vectors, models, and classifiers. To improve the speaker recognition performance, we must affect at least one of these modules. In this paper we propose to use subband spectral centroids (SSCs) as a complementary features with the traditional MFCC features, and a new GMM training algorithm, with the ultimate goal to search the bette...

متن کامل

Maximum Entropy Based Data Selection for Speaker Recognition

This paper presents the data selection method for speaker recognition. Since there is no promise that more data guarantee better results, the way of data selection becomes important. In the GMM-UBM speaker recognition, the UBM is trained to represent the speaker-independent distribution of acoustic features while the GMM speaker model is tailored for a specific speaker. In this study of data se...

متن کامل

Factor analysis subspace estimation for speaker verification with short utterances

Training the speaker and session subspaces is an integral problem in developing a joint factor analysis GMM speaker verification system. This work investigates and compares several alternative procedures for this task with a particular focus on training and testing with short utterances. Experiments show that better performance can be obtained when an independent rather than simultaneous optimi...

متن کامل

Automatic Speech Emotion and Speaker Recognition based on Hybrid GMM and FFBNN

In this paper we present text dependent speaker recognition with an enhancement of detecting the emotion of the speaker prior using the hybrid FFBN and GMM methods. The emotional state of the speaker influences recognition system. Mel-frequency Cepstral Coefficient (MFCC) feature set is used for experimentation. To recognize the emotional state of a speaker Gaussian Mixture Model (GMM) is used ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000